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LOCAL PROCESS VARIATION CORRECTION 
FOR OVERLAY MEASUREMENT 
Weidong Yang 
Roger R. Lowe- Webb 
5 Silvio Rabello 

FIELD OF THE INVENTION 

The present invention relates to overlay metrology, and in particular to an overlay target 
and method of measurement that corrects for local process variations in the overlay error. 

10 

BACKGROUND 

Semiconductor processing for forming integrated circuits requires a series of processing 
steps. These processing steps include the deposition and patterning of material layers such as 
insulating layers, polysilicon layers, and metal layers. The material layers are typically 

15 patterned using a photoresist layer that is patterned over the material layer using a photomask 
or reticle. Typically, the photomask has alignment targets or keys that are aligned to fiduciary 
marks formed in the previous layer on the substrate. However, as the size of integrated circuit 
features continues to decrease, it becomes increasingly difficult to measure the overlay 
accuracy of one masking level to the previous level. This overlay metrology problem becomes 

20 particularly difficult at submicrometer feature sizes where overlay alignment tolerances are 
reduced to provide reliable semiconductor devices. 

One type of overlay measurement that has recently been developed is based on the 
diffraction of light by a number of alignment pads. Fig. 1 shows one example of a diffraction 
based overlay measurement pattern 10. As illustrated in Fig. 1, pattern 10 includes two 

25 separate alignment pads 20 and 30. The pads are made of the materials that are deposited onto 
the substrate. Each pad 20, 30 includes a bottom layer 12, a bottom diffraction grating 24, 34, a 
top layer 16, and a top diffraction grating 28, 38. The bottom layer 12 may be, e.g., one or 
more films on a substrate. The bottom diffraction gratings 24, 34 may be, e.g., metal lines 
surrounded by dielectric material. The top layer 16 may be, e.g., one or more films disposed 



-1- 



over the substrate and the bottom diffraction grating material. The top diffraction gratings 28, 
38 may be, e.g., metal lines, metal lines surrounded by dielectric material, photoresist lines, etc. 

As illustrated in Fig. 1, the top diffraction gratings 28, 38 are intentionally offset from 
the bottom diffraction gratings 24, 34. In addition, the offsets of pads 20 and 30 are equal in 
5 magnitude but opposite in direction. With such an offset configuration, a misalignment to the 
left or right in Fig. 1 can be easily detected using pattern 10. 

The diffraction based metrology pattern 10 relies on the symmetry of the alignment 
pads 20 and 30. When the symmetry is broken by an alignment error, the diffraction patterns 
produced by each alignment pad will change by different amounts. Fig. 2, by way of example, 
10 illustrates metrology pattern 10 with an alignment error to the right in the figure, which as can 
be seen, creates an asymmetry between the pads 20 and 30. By comparing the diffraction 
patterns from pads 20 and 30, the amount of alignment error can be determined. 

In the presence of local process variations, however, diffraction based overlay 
measurement may be biased. Local process variations are created during processing of the 
15 measurement pattern 10, e.g., during chemical mechanical polishing, and result in differences 
in film thickness, grating height or grating linewidth. Local process variations may cause an 
asymmetry between the alignment pads 20 and 30 even if there is no alignment error. 
Accordingly, when there is an alignment error, the difference in collected spectra from pads 20 
and 30 contains information from both the overlay shift and the process variations. These 
20 contributions cannot be directly separated. 

Thus, what is needed is an improved diffraction based metrology technique that can 
correct for local process variations when measuring the overlay error. 

SUMMARY 

25 A diffraction based overlay metrology method and apparatus, in accordance with the 

present invention, determines overlay error independent of local process variations. The 
contributions from process variations are removed by measuring the overlay pattern before and 
after the top diffraction gratings are formed. A plurality of differential spectra from the 
measurement locations of the completed overlay pattern can then be used with a plurality of 
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ratios of differential spectra from measurement locations of the incomplete overlay pattern to 
determine the overlay error by either direct calculation or by fitting techniques. 

In one embodiment of the present invention, a method of measuring an overlay error 
that is corrected for process variations includes producing an incomplete overlay pattern, which 
5 has a plurality of measurement locations. The incomplete overlay pattern has a diffraction 
grating at each measurement location. The plurality of measurement locations of the 
incomplete overlay pattern are illuminated with incident radiation that reacts with the 
diffraction gratings and the resulting radiation is then detected. The overlay pattern is then 
completed by forming top diffraction gratings over the previously formed bottom diffraction 

10 gratings. The top diffraction gratings are formed with a designed in offset from the bottom 
diffraction gratings, i.e., the top and bottom diffraction gratings are intentionally offset by a 
known amount when there is no overlay error. The plurality of measurement locations are then 
illuminated again with incident radiation and the resulting radiation is detected. The detected 
radiation from the plurality of measurement locations of the incomplete overlay pattern and the 

15 detected radiation from the plurality of measurement locations of the completed overlay pattern 
are then used to determine the overlay error. In one embodiment, the detected radiation from 
the incomplete overlay pattern and the detected radiation from the completed overlay pattern 
are used to generate a plurality of ratios of differential spectra from measurement locations of 
the incomplete overlay pattern and a plurality of differential spectra from measurement 

20 locations of the completed overlay pattern. The plurality of ratios and the plurality of 

differential spectra are then used to determine the overlay error, e.g., by direct calculation or by 
fitting. 

Another embodiment of the present invention is a method of determining the overlay 
error, which is corrected for process variations, in an overlay pattern having a plurality of 

25 measurement locations, each measurement location having a bottom diffraction grating and a 
top diffraction grating, each top diffraction grating overlies a corresponding bottom diffraction 
grating and has a designed in offset, wherein a plurality of measurement locations have 
different designed in offsets. The method includes generating a first plurality of differential 
spectra, the spectra being measured from measurement locations of the overlay pattern prior to 

30 producing the top diffraction gratings and generating a second plurality of differential spectra, 



the spectra being measured from measurement locations of the overlay pattern after producing 
the top diffraction gratings. The first plurality of differential spectra and the second plurality of 
differential spectra are then used to determine the overlay error. In one embodiment, the first 
plurality of differential spectra is used to generate a plurality of ratios of differential spectra 
that are used to determine the overlay error along with the second plurality of differential 
spectra. 

Another embodiment includes an apparatus for determining overlay error that is 
corrected for process variations in an overlay pattern having a plurality of measurement 
locations, each measurement location having a bottom diffraction grating and a top diffraction 
grating, each top diffraction grating overlies a corresponding bottom diffraction grating and has 
a designed in offset, wherein a plurality of measurement locations have different designed in 
offsets. The apparatus includes a radiation source for producing radiation to be incident on the 
plurality of measurement locations of the overlay pattern and a detector for detecting the 
spectra after the radiation interacts with the measurement locations of overlay pattern. The 
apparatus also includes a computer and a computer-usable medium having computer-readable 
program code embodied therein for causing the computer to perform the acts of generating a 
first plurality of differential spectra, the spectra being detected from measurement locations of 
the overlay pattern prior to producing the top diffraction gratings and generating a second 
plurality of differential spectra, the spectra being detected from measurement locations of the 
overlay pattern after producing the top diffraction gratings. The first plurality of differential 
spectra and the second plurality of differential spectra are then used to determine the overlay 
error. In one embodiment, the computer-readable program code further causes the computer to 
perform the act of generating a plurality of ratios of differential spectra using the first plurality 
of differential spectra and using the plurality of ratios to determine the overlay error along with 
the second plurality of differential spectra. 

Another aspect of the present invention includes a method that includes providing an 
overlay pattern having a plurality of measurement locations, each measurement location 
includes a bottom diffraction grating and a top diffracting grating that overlies the bottom 
diffraction grating and has a designed in offset from the bottom diffraction grating. The 
plurality of measurement locations are illuminated with incident radiation that reacts with the 



diffraction gratings and the radiation is detected after reacting with the diffraction gratings. 
The overlay error between the bottom diffraction gratings and the top diffraction gratings is 
determined using the detected radiation from the measurement locations from the overlay 
pattern, and the overlay error is corrected for effects of local process variations created during 
5 processing of the overlay pattern using the detected radiation from at least one pair of the 
measurement locations from the overlay pattern. In one embodiment, the method includes 
illuminating the plurality of locations before completing the overlay pattern, i.e., illuminating 
each location of the incomplete overlay pattern, and detecting the radiation. Determining the 
overlay error and correcting the overlay error are performed using the detected radiation from 

10 both the complete and incomplete overlay pattern. In another embodiment, the overlay pattern 
includes at least a first pair of the measurement locations have a designed in offset of a first 
magnitude and at least a second pair of the measurement locations have a designed in offset of 
a second magnitude, where the correcting the overlay error uses the detected radiation from the 
first pair of measurement locations. The embodiment may further include using the second pair 

15 of measurement locations to correct the overlay error. The first pair may have designed in 
offsets in equal or opposite directions and the second pair has designed in offsets in the 
opposite directions. In one embodiment, correcting the overlay error for effects of local 
process variations created during processing of the overlay pattern is performed while 
determining the overlay error. 

20 In another aspect of the present invention, a method includes providing an overlay 

pattern having at least four measurement locations, each measurement location having a bottom 
diffraction grating and a top diffracting grating that overlies the bottom diffraction grating and 
has a designed in offset from the bottom diffraction grating, at least two pairs of the 
measurement locations have the same magnitude designed in offset. The method includes 

25 illuminating each of the measurement locations of the overlay pattern with incident radiation 
that reacts with the diffraction gratings and detecting the radiation from the measurement 
locations after reacting with the diffraction gratings. The overlay pattern is determined using 
the detected radiation from the measurement locations, and the overlay error is corrected for 
effects of local process variations using the detected radiation from at least one pair of the 

30 measurement locations from the overlay pattern. In one embodiment, correcting the overlay 
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error for effects of local process variations created during processing of the overlay pattern is 
performed while determining the overlay error. 



BRIEF DESCRIPTION OF THE DRAWINGS 
5 Fig. 1 shows one example of a diffraction based overlay measurement pattern. 

Fig. 2 illustrates the metrology pattern of Fig. 1 with an alignment error. 

Fig. 3 is a flow chart illustrating an embodiment of the present invention of measuring 
an overlay error that is corrected for any process variations from the fabrication of the overlay 
pattern. 

10 Fig. 4 is a side view of one embodiment of an incomplete overlay pattern, i.e., an 

overlay pattern without the top diffraction gratings formed. 

Fig. 5 illustrates a normal incidence polarized reflectance spectrometer that may be used 
to measure the overlay pattern. 

Fig. 6 is a side view of one embodiment of a completed overlay pattern, i.e., an overlay 
1 5 pattern with the top diffraction gratings formed with designed in offsets. 

Fig. 7 is a flow chart that illustrates using the detected radiation from the incomplete 
overlay pattern and the detected radiation from the completed overlay pattern to determine the 
overlay error. 

Fig. 8 is a flow chart that illustrates one embodiment of determining the overlay error 
20 from the plurality of ratios and differential spectra. 

DETAILED DESCRIPTION 

A diffraction based overlay metrology method corrects the effects caused by local 
process variations in accordance with the present invention. In general, diffraction based 
25 overlay measurements include contributions from both the overlay shift as well as process 
variations in the formation of the overlay pattern. By removing the effects of the process 
variations, the accuracy of the measurement of the overlay error is increased. 

Fig. 3 is a flow chart 100 illustrating an embodiment of the process of measuring an 
overlay error that is corrected for any process variations from the fabrication of the overlay 
30 pattern. The method, in accordance with an embodiment of the present invention, includes 



measuring diffracted radiation from the overlay pattern before and after the top diffraction 
grating is produced and using the measured diffracted radiation to determine the overlay error, 
which is corrected for process variations. Thus, as illustrated in Fig. 3, in step 102 an 
incomplete overlay pattern is produced, i.e., the overlay pattern is produced without the top 
5 diffraction gratings. 

Fig. 4 is a side view of one embodiment of an incomplete overlay pattern 200. Overlay 
pattern 200 includes a plurality of measurement locations, sometimes referred to herein as 
grating pads Pi, P 2 , P3 . - . P n , where n represents the number of the last pad. While Fig. 4 
shows four pads, it should be understood that additional pads may be used if desired. 

10 Each grating pad includes a bottom layer 202, a bottom diffraction grating 214, 224, 

234, and 244, and a top layer 216. As can be seen in Fig. 4, the top layer 216 varies in 
thickness from one pad to the next. The variation in the thickness of the top layer 216 is an 
example of local process variations, for which the present invention compensates. The 
variation in the thickness of the top layer 216 from one pad to the next may be caused by, e.g., 

1 5 chemical mechanical polishing. 

As is well understood, the bottom layer 202 may be one or more layers on a substrate or 
the substrate itself. The bottom diffraction gratings are produced from the materials that are 
deposited on the substrate at that level and may be, e.g., metal lines surrounded by dielectric 
material or any other appropriate material. The bottom diffraction gratings 214, 224, 234, 244 

20 have the same parameters, e.g., line height, line width, and pitch. The top layer 216 may be, 
e.g., one or more films disposed over the substrate and the bottom diffraction grating material. 
Preferably, the incomplete overlay pattern 200 is as close to complete as possible, but without 
the top diffraction gratings formed. For example, in one embodiment where the material for 
the top diffraction gratings is transparent or partially transparent, the material for the top 

25 diffraction gratings may be deposited, but not etched, over top layer 216. The processing steps 
of depositing films and etching patterns in the films is well known in the art. 

Referring back to Fig. 3, in step 104 the incomplete overlay pattern 200 is illuminated 
and the resulting radiation is detected from each grating pad P h P 2 , P 3 . . . P n . For example, the 
diffracted spectra may be measured after the incident radiation interacts with the bottom 

30 diffraction grating (as well as the top layer 216 and bottom layer 202). 
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The measurement of the spectra may be performed using a reflectometer, ellipsometer, 
scatterometer or any other appropriate instrument. By way of example, Fig. 5 illustrates a 
normal incidence polarized reflectance spectrometer 300 that may be used to measure a pattern 
100. Spectrometer 300 is discussed in detail in the U.S. Patent Application entitled "Apparatus 
5 and Method for the Measurement of Diffracting Structures," filed September 25, 2000, having 
Serial No. 09/670,000, which has the same assignee as the present disclosure and is 
incorporated herein by reference. 

As shown in Fig. 5, spectrometer 300 is similar to a reflectometer, which is well known 
in the art. Spectrometer 300 includes a polychromatic light source 302 that generates a light 

10 beam that is partially reflected by beam splitter 304 along the optical axis 303. The light beam 
is directed towards a sample 306 having an overlay pattern, such as pattern 200, to be 
measured. Sample 306 may be, e.g., a semiconductor wafer or flat panel display or any other 
substrate, and is supported by a stage 308, which may be a polar coordinate, i.e., R-0, stage or 
an x-y translation stage. Spectrometer 300 includes a rotatable polarizer 310 and a lens 312 (or 

15 series of lenses) to polarize and focus the light beam onto the sample 306 at normal incidence. 
The beam is reflected off sample 306 and the reflected light is transmitted through lens 3 12 and 
polarizer 310. A portion of the reflected light is transmitted through beam splitter 304 and is 
received by a spectrophotometer 314. Spectrophotometer 3 1 4 is coupled to processor 3 1 6, 
which analyzes the data provided by spectrophotometer 314. Processor 316 is e.g., a computer 

20 with a computer-usable medium having computer-readable program code embodied therein for 
causing the computer to determine the overlay error based on the light that is diffracted from 
the overlay pattern as will be described herein. One of ordinary skill in the art can program 
code capable of determining the overlay error that is corrected for local variations in 
accordance with the present invention in view of the present disclosure. For more information 

25 on the general operation of a normal incidence polarized reflectance spectrometer, the reader is 
referred to Serial Nos. 09/670,000, which is incorporated herein by reference. 

Because the top diffraction gratings have not been formed on pads Pi, P 2 , P3 . . . P n , the 
pads should be identical in the present embodiment, except for any local process variations, 
illustrated in Fig. 4 as the variation in thickness of the top layer 116 from one pad to the next. 
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Thus, differences in the detected radiation, e.g., spectra, from each pad in overlay pattern 200 
are caused by process variations. 

After the incomplete overlay pattern 200 is measured in step 104 of Fig. 3, the overlay 
pattern 200 is then completed by fabricating the top diffraction gratings (step 106). The 
5 completed overlay pattern 200' is shown in Fig. 6. The completed overlay pattern 200' is the 
same as incomplete overlay pattern 200 (shown in Fig. 4), but with the addition of the top 
diffraction gratings 218, 228, 238, and 248 on pads Pi, P 2 , P3, P n respectively. The top 
diffraction gratings 218, 228, 238, and 248 are produced such that they are intentionally offset 
by a known amount from the corresponding bottom diffraction gratings when there is no 
10 overlay error. This intentional offset is referred to herein as a designed in offset. 

As illustrated in Fig. 6, the pads Pi, P 2 , P3 . . . P n have designed in offsets Di, D 2 , D 3 . . . 
D n between the top and bottom diffraction gratings. The designed in offsets may be different 
for each pad Pi, P 2 , P3 . . . P n . hi some embodiments, some of the pads may have equal but 
opposite designed in offsets, i.e., the offsets have the same magnitude but opposite signs, e.g., 
1 5 D 2 =-D 3 . Further, in some embodiments, some of the designed in offsets may be identical, e.g., 
D 2 =D 4 . 

Typically, during the fabrication of the layer that includes the top diffraction gratings, 
there is an overlay error which is caused, e.g., by improper alignment between the mask and the 
underlying gratings. The overlay error is, thus, evidenced by a change in the alignment of the 
20 top diffraction gratings with the bottom diffraction gratings. The overlay error e is the same for 
each of the pads in the overlay pattern. The resulting total overlay shift Oi, 0 2 , 0 3 . . . O n for 
each pad is, thus, equal to the designed in offset plus the overlay error e, Di+e, D 2 +e, D 3 +e . . . 
D n +e. 

It should be understood that Fig. 6 illustrates an overlay pattern 200' that can be used to 
25 measure the overlay error in one direction, i.e., parallel with the page. An additional overlay 
pattern that is orthogonal to overlay pattern 200' would be used to measure the overlay error in 
another direction, i.e., perpendicular to the page. If desired, a single overlay pattern that uses 
bi-directional gratings may be used in accordance with the present invention. 

Once the overlay pattern 200' is completed, the overlay pattern 200' is then illuminated 
30 and the resulting radiation is detected from each grating pad P u P 2 , P 3 . . . P n in step 108 of Fig. 
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3. As described in reference to step 104, the diffracted spectra may be measured from each 
pad. The resulting radiation that is detected includes contributions from any process variations 
as well as the overlay shift, which includes the designed in offset and the overlay error. 

As shown in step 1 10 of Fig. 3, the detected radiation from the incomplete overlay 
5 pattern 200 and the detected radiation from the completed overlay pattern 200' are then used 
together to determine the overlay error, which is independent of process variations. The 
overlay error is corrected for process variations by using the detected radiation from the 
incomplete overlay pattern 200, which includes contributions from process variations alone, 
and using the detected radiation from the completed overlay pattern 200', which includes 

10 contributions from both the process variations and the overlay shift. 

Fig. 7 is a flow chart 400 that illustrates using the detected radiation from the 
incomplete overlay pattern 200 and the detected radiation from the completed overlay pattern 
200' to determine the overlay error. As shown in step 401, a plurality of differential spectra 
from the incomplete overlay pattern is generated. The differential spectra from the pads of the 

1 5 incomplete overlay pattern 200 have the following relationships: 

S i -S j =P(E i -E J ) eq.l 

S k -S t =P(E k -E g ) eq.2 

where i, j f k, and / are integers representing the number of the measured pad, S represents the 
spectra measured from a pad of incomplete overlay pattern 200, E represents the parametrical 
20 process variation of the measured pad, and P represents the unit spectral contribution from 
processing before forming of the top diffraction gratings. A plurality of ratios of differential 
spectra from the incomplete overlay pattern can be generated (step 402), as follows: 

— — = . eq.3 

S k~ s i E k~ E i 

A plurality of differential spectra from the completed overlay pattern 200' is generated 
25 in step 404 in Fig. 7. The spectra from the pads of the completed overlay pattern 200' have the 
following relationship: 
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T t - Tj = Q{E t -Ej) + OvliO; - Oj ) eq.4 



where T represents the spectra measured from a pad of completed overlay pattern 200', O 
represents the total overlay shift of the measured pad, Q represents the unit spectral 
contribution from processing after forming of the top diffraction gratings, and Ovl represents 
5 the unit spectral contribution from the overlay shift. 

Using the plurality of ratios of the differential spectra from the incomplete overlay 
pattern 200 and the plurality of differential spectra from the completed overlay pattern 200', the 
overlay error e can be determined corrected for process variations (step 406). In one 
embodiment, the overlay error e is determined by direct calculation using the measured spectra 
10 from the incomplete overlay pattern 200 and the completed overlay patter 200', as well as the 
known designed in offsets. In another embodiment, the overlay error e is determined by curve 
fitting the differential spectra. 

In one embodiment, in which the overlay error can be directly calculated based on the 
measured radiation, two of the pads Pi and Pj have the same designed in offset and therefore the 
15 same total overlay shift. Table 1 shows an example of the present embodiment. 





Pad 1 


Pad 2 


Pad 3 


Pad 4 


Designed in Offset 


-D 


+D 


D+d 


+D 


Total Overlay Shift 


-D+e 


D+e 


D+d+e 


D+e 


Parametrical process variation 


Ei 


E 2 


E 3 


E 4 


Spectra (Before Top Gratings) 


Sj 


S 2 


s 3 


s 4 


Spectra (After Top Gratings) 


T, 


T 2 


T 3 


T 4 



TABLE 1 



The term D represents the magnitude of the offset and the term d represents the magnitude of a 
reference offset. In general, the offset D, may be 5% to 40% of the pitch, and more particularly 
25%, while the reference offset d can be 1% to 15% of the pitch, e.g., 5% of the pitch. 
20 However, if the reference offset d is too small, the reference offset d will be lost in the noise, 
while if too large, the reference offset may lose linearity. 

From equation 3 above, in the present embodiment, the spectra from the pads before the 
top diffraction gratings are formed have the following relationships-: 
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E,-E 2 =(E 4 -E 2 )?^k. eq.6 

From equation 4 above, the spectral difference between pads can be calculated, where 
spectral difference between pads 2 and 4 (after the top diffraction gratings are formed) will 
contain contributions only from process variations because the total overlay shift O of the two 
pads are the same. 

T 4 -T 2 =Q(E 4 ~E 2 ) eq.7 
The spectral difference between any other combinations of pads, however, will contain 
information relating to both the process variations and the overlay shift, e.g., 

T 3- T 2= Q( E 3 -Ei) + d * Ovl . eq. 8a 
Equation 8a can be rewritten, using equation 6, as: 

T 3 -T 2 =Q(E 4 -E 2 )^^ + d*Ovl 

4 2 eq.8 
= (T 4 -T 2 )f-± + d*Ovl. 

The term d*Ovl is a reference spectra and may be written as a function of measured spectral 
values as: 

d*Ovl = (T 3 -T 2 )-(T 4 -T 2 )^^. eq.9 

The reference spectra can be used to determine the overlay error e once the spectra from 
the overlay error is also determined. To determine the spectra from the overlay error, the 
spectral relation between, e.g., pads 1 and 2 may be analyzed. As indicated above in Table 1, 
the total overlay shift (designed in offsets plus overlay error) for pads 1 and 2 is-D+e, and D+e, 
respectively. Due to symmetry, the spectrum acquired from pad 1 with offset -D+e is 
equivalent to a spectrum acquired from a pad with shift +D-e; that is, the relative offsets of 
padsl and 2 becomes D+e - (-D+e) = D+e - (+D-e) = 2e. The spectral relations of pads 1 and 
2 can be written as follows. 

S 2 -S, =P(E 2 -E r ) eq. 10 
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T 2 -T x =Q(E 2 -E x ) + 2e*Ovl 

= Q(E 4 -E 2 )^± + 2e*Ovl eq. 11 
A 4 -o 2 

= (r 4 -r 2 )|^ + 2**Ov/ 

The term 2e*0v/ is the error spectra, which may be written as a function of measured spectral 
values as: 

2e * Ovl = (r 2 - Tl )-(T 4 -T 2 ) eq. 12 

Combining the overlay error spectra from equation 12 with the reference spectra from 
equation 9 produces: 

s 2 -s l 



(T 2 -T X )-(T 4 -T 2 ) 



2e_ " "S 4 -S 2 

d (T 3 -T 2 )-(T 4 -T 2 ) S '~ S > 



eq. 13 



S 4 -S 2 

Accordingly, the overlay error e is found to be: 

(r 2 -7;)-(7;-:r 2 )|^ 

e = — * — ea. 14 

2 ( r 3 -r 2 )-(r 4 -r 2 )^^ 

As can be seen in equation 14, the overlay error e can be written as a function of the spectra 
measured from the pads before and after the top diffraction grating is formed and as a function 
of the designed in overlay shift d. Thus, the overlay error e can be determined using the 
plurality of differential spectra from the incomplete overlay pattern 200 (or a plurality of the 
ratio of differential spectra) and a plurality of differential spectra from the completed overlay 
pattern 200' as provided in equation 14 (step 406 in Fig. 7). 

It should be understood that the subscripts herein can be generalized to any of the pads 
if desired. It should also be understood that, while the present embodiment is described with 
pads 1 and 2 having designed in offsets of equal but opposite magnitudes, the pads 1 and 2 may 
have designed in offsets of different magnitudes if desired. Further, additional pads may be 
used and additional or different ratios may be used. Useful ratios and pad configurations that 
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may be used with the present embodiment are discussed in U.S. Serial No. 10/1 16,863, which 
is incorporated herein. 

In another embodiment of the present invention, the overlay error e may be corrected 
from local process variations in the overlay pattern without the use of pads that have the 
identical designed in offsets, i.e., pads 4 and 2 do not have the same designed in offsets in 
Table 1 . Table 2 below describes one example of such an embodiment. 



Pad: 


P. 


P 2 


P 3 


P 4 


Designed Offset 


-D-d 


-D 


D 


D+d 


Offset with Error 


-D-d+e 


-D+e 


D+e 


D+d+e 


Process 


Ei 


E 2 


E 3 


E 4 


Spectra, no top grating 


s, 


s 2 


s 3 


s 4 


Spectra, with top grating 


T, 


T 2 


T 3 


T 4 



TABLE 2 

Using the description provided in Table 2, the spectra from the pads, before the top 
diffraction gratings are formed, may be written as follows, where b is a constant representing 
the spectrum when there is no process shift. 



S x =E l P + b 

5 2 =E 2 P + b 

5 3 = E 3 P + b 

5 4 = E,P + b 



eq. 15 



As stated above: 



Si-Sj 



E,-E, 



eq. 16 



S m -S n E m -E„ 

m n m n 

where i, j, m, and n are integers representing the pad numbers, e.g., from 1 to 4. 

The spectra from the pads described in Table 2, after the top diffraction gratings are 
formed, may be written as follows, where c is a constant representing the spectrum when there 
is no process contribution to the spectrum. 
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7; = (d-e)Ovl + E x Q + c 

t = -eOvl + E 2 Q + c 

2 2 * eqs. 17 

T 3 =eOvl + E 2 Q + c 

T 4 = (d + e)Ovl + E 4 Q + c 

Using the values from equation 17, a plurality of differential spectra 7}- 7) may be written as 
follows. 

T l -T 2 =dOvl + (E l -E 2 )Q 
T 4 -T 3 =dOvl + (E 4 -E 2 )Q 

eqs. 18 

T 4 -T x =2eOvl + (E 4 -E x )Q 
T 3 -T 2 =2eOvl + (E 3 -E 2 )Q 

Using the above equations, the overlay error e can be determined by either direct 
calculation or by fitting. By way of example, in one embodiment, the overlay error e can be 
directly calculated by deriving a ratio of the overlay error e versus the reference offset d. This 
embodiment is particularly useful when the process variation is significant and the spectral 
difference is detectable before the top gratings are formed. 

Combining a ratio from equation 16 with one of the differential spectra values in 
equation 18, e.g., T 4 -T$, produces: 

(7 4 - r 3 )fz^ = dOvl^- + (£ 4 - E 3 )^-Q 
o 4 — o 3 o 4 — o 3 o 4 — o 3 

(T 4 -T 3 )^^ = dOvl^^HE 4 -E 3 )^-^-Q eqs. 19 

By substituting in the differential spectra value T r T 2 from equations 18 into equation 19, the 
following is obtained. 

P* - T * ) " (1 " 7 a ) = ^(f^ - 1) eq. 20 

Similarly, using the differential spectra values T 4 -T] from equation 18 and a ratio from equation 
16 produces: 

(T 4 - Tl )?^ = 2eOvl?^ + (E 3 -E 2 )Q eq. 21. 
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By substituting in the differential spectra value T 3 -T 2 from equation 18 into equation 21 , the 
following is obtained. 

(T 4 -T 1 )^^-(T 3 -T 2 ) = 2eOvl(^^-l). eq. 22 

Using equations 20 and 22, a ratio of the overlay error e to the reference offset d 9 i.e., 
e/d, can be generated with no unknown terms. Accordingly, the overlay error e may be written 
as the following. 

d (f^-.> 

e ~ — * — L ea 23 

Thus, the overlay error e can be directly calculated (step 406 in Fig. 7) using a plurality of 
differential spectra from the measurement locations of the completed overlay pattern 200' (step 
404 in Fig. 7), a plurality of ratios of differential spectra from measurement locations of the 
incomplete overlay pattern 200 (step 402 in Fig. 7), and the reference offset d. 

In another embodiment of directly calculating the overlay error e, the spectral 
contribution from the processing variations Q and the spectral contribution from the overlay 
shift Ovl are assumed to be orthogonal or near orthogonal, which can be expressed as: 

Using the following differential spectra terms from equations 18: 

T l -T 2 =dOvl + (E,-E 2 )Q e ^ 
T 4 -T 3 =dOvl + (E 4 -E 3 )Q 6q ' 

the spectral contribution Q from processing variations can be written as: 

T x -T 2 +T 3 -T 4 =(£, -E 2 + E 3 -E 4 )Q eq. 26 
The reference spectra dOvl, i.e., the spectra associated with the reference offset d, can 
be determined using the differential spectra T } -T 2 from equation 18. 
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T i -T 2 =dOvl + (E i -E 2 )Q 

= d0vl+ r f'"f 2 . W-Et+Et-EJQ eq.27 

rfOv/ = (r,-r 2 )- ^ (E l -E a +E i -E 4 }Q eq.28 

By inserting equation 26 into equation 28, the reference spectra dOvl can be derived using 
measured spectra as: 

c/Ov/ = (7, -T t )- J s x _ (I, -T 2 + T 3 -r 4 ). eq.29 

The error spectra leOvl, i.e., the spectra associated with the offset error e, can then be 
found in a similar manner, using the differential spectra T 3 -T 2 as follows: 
T 3 - T 2 = leOvl + (£ 3 - E 2 )Q 

= 2e0vl + E 1 -e\\e\-E 4 (El - E > +E i- E ^Q e< l- 30 

= 2e0vl+ Sl -s~ + S s 3 -s 4 ( E i- E 2 +E >- E ^Q 

2eOvl = (T 3 -r 2 )-_Az|l__ (£i _ E2 +Ej _ Ea) q eq . 31 

Inserting equation 26 into equation 31, produces the error spectra leOvl as a function of 
measured spectra. 

2eOvi = (r 3 - t 2 ) - — Jizji_^ (2; - r 2 + r 3 - r 4 ) eq . 32 

By analyzing the ratio of equations 32 and 29, the overlay error e can be solved directly 
as follows. 

e= 7* S -S eq ' 33 

(r '- r ' ) - .,-^5;-5 ) (T '- r ' +r '- r - ) 
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Thus, again, the overlay error e is directly calculated (step 406 in Fig. 7) using a 
plurality of differential spectra from the measurement locations of the completed overlay 
pattern 200' (step 404 in Fig. 7), a plurality of ratios of differential spectra from measurement 
locations of the incomplete overlay pattern 200 (step 402 in Fig. 7), and the reference offset d. 
5 It should be understood, of course, the overlay error e may be directly calculated from 

equivalent expressions to those provided above. The overlay pattern may use additional pads 
and/different configurations, including different designed in offsets than those provided in 
Table 2. Further, the overlay error e may calculated using other types of ratios such as those 
discussed in U.S. Serial No. 10/1 16,863, which is incorporated herein. Nevertheless, in order 
10 to correct for process variations, the expression of the overlay error e uses a plurality of 

differential spectra from the measurement locations of the completed overlay pattern 200', a 
plurality of ratios of differential spectra from measurement locations of the incomplete overlay 
pattern 200. 

In another embodiment, the overlay error e can be determined by fitting the parameters. 

15 One embodiment of fitting, assumes the spectral contributions from the processing variations Q 
and the spectral contribution from the overlay shift Ovl are orthogonal, as discussed above. In 
accordance with the present invention, using the plurality of ratios and differential spectra to 
determine the overlay error e step 406 in Fig. 7 is further specified in Fig. 8. As illustrated in 
Fig. 8, the spectral contribution from the overlay shift Ovl (step 502) is determined from the 

20 plurality of ratios and the plurality of differential spectra. The spectral contribution from the 
overlay shift Ovl, can be determined from the following: 
it\S<-S,\>\S x -S 2 \ 

otherwise: 

25 {Tl -T 2 )?f^-(T A -T 3 ) = dOvl(^^L-l). eq.35 

-i> 2 a, -o 2 

The spectral contribution from processing after the overlay pattern is completed, i.e., Q, 
is also determined (step 504). From equation 26, the spectral contribution Q can be written as 
follows. 
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g- T >- T > +T >- T - eq.36 
E x — E 2 + E 3 — E 4 

The spectral contribution from processing Q is determined from equation 24 using the 
previously determined spectral contribution Ovl. 

In step 506 in Fig. 8, the overlay error e is determined from fitting the differential 
5 spectra T 4 -7] or T 3 -T 2 using the derived spectral contributions Ovl and Q, while floating e 
and E 4 -E x or E 3 ~E 2 in the following equations (from equation 18). 

T A -T x =2eOvl + {E<-E x )Q 
T 3 -T 2 =2eOvl + (E 2 -E 2 )Q ^ 

Using fitting to determine an unknown value is well known in the art. By way of example, one 
method of fitting is as follows. The equation to be fit may be written as 
10 S = aP + bQ eq. 38 

where S, P> and Q are known spectra and a and b are floating variables. Determining a and b is 
equivalent to finding the minimum of the following: 

F{a 9 b) = YSS-<*i+*>Qi)\ eq.39 

i 

Finding the minimum of equation 39 is as follows: 
which may be written as: 

i i i 

Equation 41 may be rewritten as: 

am u +bm n = c, 



eq. 41 



am 12 + 6w 22 = c 2 



eq. 42 



20 where: 
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m 22 = £a 2 



Thus, the terms a and b may be determined according to the following: 



c x m n -c 2 m n 

m i2 ~ m \\ m 22 



eq. 44 



c,m 22 -c 2 m 12 



Alternatively, equation 38 may be solved with any regression method. 

In another embodiment, the overlay error e can be found by direct fitting, where the 
spectral contributions Ovl and Q are again assumed to be orthogonal. Referring back to Fig. 7, 
in this embodiment, a plurality of differential spectra from the measurement locations of the 
completed overlay pattern 200' are generated (step 404 in Fig. 7), e.g., 



Further, a plurality of ratios of differential spectra from measurement locations of the 
incomplete overlay pattern 200 are generated (step 402 in Fig. 7), i.e., 



where m, and n are integers representing the pad numbers. 

Using the plurality of ratios and the plurality of differential spectra from measurement 
locations of the completed overlay pattern, the overlay error e can be determined (step 406 in 
Fig. 7) by floating the terms e f Ovl and Q and finding a best fit for all four equations in 
equation 45. As a result, the overlay error e can be solved. It should be understood that 
because both Ovl and Q are spectra, the fitting is performed for every wavelength, while 
making the overlay error e identical for all the wavelengths. 




dOvl + (E x -E 2 )Q 
dOvl + (E 4 -E 3 )Q 
2eOvl + (E A -E x )Q 
2eOvl + (E 3 -E 2 )Q. 



eq. 45 
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In yet another embodiment, the premeasurement of the spectra S from the overlay 
before the top grating is formed can be obviated with the careful design of the pads in the 
overlay pattern. Referring to Fig. 6, by way of example, two pads may have the same designed 
in offsets, e.g., D2=D 4 , while the other two pads have the same magnitude designed in offset 
but are in opposite directions, i.e., Di=-D 3 . Alternatively, the designed in offsets of the four 
pads may be symmetrical, such that there are two pairs of pads, each pair having the same 
magnitude designed in offset but in opposite directions, e.g., Di=- D 4 and D2—D3. Of course, if 
desired, additional pads may be used. 

Where two pads have the same designed in offsets, the term cQ can be constructed 
directly from the measured spectra T obtained from these two pads without the use of a 
premeasurement. Where the four pads are symmetric, the term cQ can be constructed directly 
from the measured spectra T obtained from the four pads without the use of a premeasurement. 
Assuming orthogonality of Q and Ov , the terms e and Ovl can be then be determined from all 
spectra. In other words, the overlay error is determined using all the pads, while the overlay 
error is corrected for effects of local process variations using the detected radiation from the 
two pads in the former embodiment, and using the four pads in the later embodiment. It should 
be understood that the he correction of the overlay error for local process variations may be 
performed while determining the overlay error. Alternatively, the two acts may be performed 
separately if desired. 

By way of example, based on the orthogonality which may be expressed as: 

£Ov/.#e,=0 eq.47 
equation 18 may be modified as follows: 

(7; -r 2 )-(r 4 -t 3 ) = (e x -e 2 +£ 3 -e 4 )q. eq . 48 

Without losing generalities let i,j=l . . .N. Equation 1 8 may be rewritten as follows: 
{T i -T j )Q = dOvl.Q + (E i -E j )Q.Q = (E i -E j )Q.Q. eq. 49 

Thus, 
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(E x -E 2 ) = 

(£ 4 -£ 3 ) = 



Q*Q 




(T 4 -T 3 ). 


Q 








Q 






(T 3 -T 2 ). 


Q 





eq. 50 



The term Ovl can then be evaluated, which can then be used to determine e. The term Q can be 

determined using equation 20. 

Let 

E,-E, T,-T. 



C, = 

E A -E 3 T 4 -T 3 

c = E ± -E L= T 1 -T L 

2 E 3 -E 2 T 3 -T 2 



eq.51 



From equation 18: 

(r, -T 2 )-C,(T A -T 3 ) = dOvl + (E 1 -E 2 )Q-C x dOvl-C x {E, -E 3 )Q eq. 52 

which with Ci from equation 51 reduces to: 

(r, -T 2 )-C,{T, -T 3 ) = dOvl + (E, -E 2 )Q-C x dOvl-(E x -E 2 )Q 

= (\-C x )dOvl eq53 

10 Also, from equation 18: . 

(J 4 - T x ) - C 2 (T 3 - T 2 ) = (1 - C, )2eOvl. eq. 54 

Combining equation 53 and 54 results in: 

2e_(T 4 -T i )-C 2 (T 3 -T 2 ) 



1-C,, 



eq. 55 

d {T X -T 2 )-C X (T 4 -T 3 ) 



from which it can be seen e can be determined. 
1 5 There are other ways of determining e from these equations. For example, combining 

equation 50 with equation 1 8 results in the following: 
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dOvl = (T x -T 2 )-(E^E 2 )Q; 

dOvl = (T 4 -T,)-(E 4 -E,)Q; ^ 

2eOvl = (T A -T X )-{E A -E x )Q;and ^ 

2eOvl=(T 3 -T 2 )-(E 3 -E 2 )Q. 

All the items on the right side of equation 56 are known, and thus, e can be calculated. 

. Although the present invention is illustrated in connection with specific embodiments 
for instructional purposes, the present invention is not limited thereto. Various adaptations and 
5 modifications may be made without departing from the scope of the invention. For example, it 
should be understood that various equivalent ratios and equations may be used in accordance 
with the present invention. Moreover, the present invention is not limited to the precise overlay 
patterns described. For example, additional pads and/or different designed in offsets may be 
used. Therefore, the spirit and scope of the appended claims should not be limited to the 
1 0 foregoing description. 
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